
What is claimed is: 

1 . A method of analyzing a first^nucleic sample comprising: 
providing said first nucleicyacid sample; 

reproducibly reducing the/complexity of said first nucleic acid sample to produce a 
second nucleic acid sample which may comprise a plurality of non-identical sequences 
whereby said second nucleic acid sample is obtainable by: 

fragmenting^&id first nucleic acid sample to produce fragments and ligating 
adaptor sequences to said fragments; 

fragmenting said first nucleic acid sample to produce fragments, denaturing 
said fragments<^il^ing"sbme of said fragments to reanneal to form double stranded 
DNA seqjiraces am removing said double stranded DNA sequences, 
amplification by arbitrarily primed PCR; 

hybrMzing said first nucleic apid sample to an oligonucleotide probe bound 
to a/solid suppcM^ 

^^Mdl^ngsa^ acid sequence to a mismatch binding protein; 

jroyiding a nucleic acid array; 

hybridizing said second nucleic acid sample to said array; and 

/ ... 
analyzing a hybridization pattern resulting from said hybridization. 

/ 

2. The method of claim 1 wherein said second nucleic acid sample comprises at least 
0.5 % of saicl nucleic acid sample 

/ 

3. /The method of claim 1 wherein said second nucleic acid sample comprises at 
least 3 %/of said nucleic acid sample 


4. The method of claim 1 wherein said second nucleic acid sample comprises at 

/ 

least 12 % of said nucleic acid sample 
/ 

at least 12% 
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5. The method of claim 1 waerein said second nucleic acid sample comprises at 
least 50 % of said nucleic acid sample 

6. The method of claim l/wherein each of said non-identical sequences differs 
from the other non-identical sequences by at least 5 nucleic acid bases. 

7. The method of claiin 1 wherein each of said non-identical sequences differs 
from the other non-identical sequences by at least 10 nucleic acid bases. 


8. The method of daim 1 wherein Sach of said non- identical sequences differs from 
the other non-identical sequences bjf at least 50 nucleic acid bases. 


i 


9. The 7 method of claim l'wherein each of said non-identical sequences differs from 
the other non-identical sequences fby at least 1000 nucleic acid bases. 

' // v y 

l(i). The method a'f claim l\^vherein said NA^ample is DNA. 



ethod of claim 1 wherein said NA sample is genomic DNA. 


12. The method of claim 1 wherein said first nucleic acid sample is cDNA derived 

/ 

from RNA or iriRNA. 


13. The method of claim 1 further comprising the step of amplifying at least one of 
the non-identical sequences in said second nucleic acid sample. 


14. The method of claim 13 wherein said step of amplifying is performed by a 
polymerase chmn reaction (PCR). 


41 


15. The method of claim 1 wherein th^ entire method is performed in a single 
reaction vessel. 

16. The method of claim 1 wherein said step of fragmenting the first nucleic acid 
5 sample comprises digestion with at least one restriction enzyme. 

17. The method of claim 1 wherejn said step of fragmenting the first nucleic acid 
sample comprises digestion with a type lis endonuclease. 


10 18. The method of claim 1 whpreirN^aifl adaptor sequences comprise PCR primer 

sp template sequences. 

o 

: p 19. The method o^ claim 1 v^herein ^kid^daptor seqji6nces comprise tag 

□ 

sequences. 

s . 

□ 20. The method of claim 1! wherein said solid support is a magnetic bead. 

^ i 

IU 21 . The method of clainj 1 wherein said mismatch binding protein is bound to a 

magnetic bead. 

20 

22. The method of claim 1 wherein said method for analyzing a nucleic acid 
sample comprises determining whether the nucleic acid sample contains sequence 
variations. 

25 23. The method of plaim 22 wherein said sequence variations are single nucleotide 

polymorphisms. 

24. The method of claim 1 wherein the step of obtaining a DNA array comprises: 



designing a DNA array to query DNA fragments which have been produced by the 
identical procedures used to obtain said second nucleic acid sample. 

25. The method of claim 24 wnerein the step of designing further requires 
5 predetermining the sequences container in said second nucleic acid sample. 
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□ 

'-15 


l J 5 

o 
I" 


26. The method of claim 
contained in said second nucleic acid 


wherein said step of predetermining the sequences 
ample is conducted in a computer system. 


27. The method of claim 23 \* 


by: 


ansaid second nucleic acid sample is obtainable 


binding oligonucleotide probes containing a desired SNP sequence to magnetic 

/ V 
beads to form probe-bead complexes; and 

hybridizing said probe-bead complexes to said DNA sample^ 

exposing said hybridized DNA sample to a single stran&DNA nuclease to remove 
single stranded DljlA thereby forming a Dk^^duplex;^ 

ligating a (KH^lg^tf^ided adaptor sequence comprising a restriction enzyme site to 
said DNA duplex; 

digesting said DNA duplex wi 


ih a restriction enzyme to release the magnetic bead; 


20 and 


isolating only those fragments containing said SNP sequence. 


endonuclease 


28. The method of claim 25 wherein said restriction enzyme is a Class lis 


25 


by: 


29. The method of claim 23 wherein said second nucleic acid sample is obtainable 

exposing the DNA sample to a mismatch bonding protein; 
employing a 3' to 5* exormclease to remove single stranded DNA; and 
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employing a nuclease to r ;move single stranded DNA. 

30. A method of screening for DjhJA sequence variations in an individual 
comprising: 


providing said first nucleic acid ; 
providing a second nucleic acid s 


ample from said individual; 
ample by reproducibly reducing the complexity of 
said first nucleic acid sample to produce a second nucleic acid sample which may 
comprise a plurality of non-identical sequences whereby said second nucleic acid 
sample is obtainable by: 

fragmenting said first nucleic acid sample to produce fragments and ligating 
adaptor sequences to said fragments 
fragmenting said first n 


1 


cleic acid sample to produce fragments, denaturing 


said fragments, allowing some ofsaidfragments to reanneal to form double stranded 
DNA sequences and removifig said doubtfe stranded DNA sequences, 
amplification by arbitra ilWprimed PCR; 

hybridizing said first nucjl|c acid sample to an oligonucleotide probe bound 
to a solid support; 

hybridizing! saidjEjr^t nucleic acitteetfuence to a mismatch binding protein; 
providing a nucleic acid array; \ 

hybridizing said second nuclei ; acid sample to said array; and 
analyzing a hybridization pattern resulting from said hybridization. 


31. The method of claim 30 wherein said sequence variation is a SNP. 


32. The method of claim 3 1 wherein said SNP is associated with a disease. 


33. The method of claim 3 1 wherein said SNP is associated with the efficacy of a 

drug. 


34. A method of screening for EJNA sequence variations in a population of 
individuals comprising: 

providing said a first nucleic acifl sample from each of said individuals; 
providing a second nucleic acid/sample by reproducibly reducing the complexity of 
said first nucleic acid sample to produce a second nucleic acid sample which may 
comprise a plurality of non-identical sequences whereby said second nucleic acid 
sample is obtainable by: f 

fragmenting said first nucleic acid sample to produce fragments and ligating 
adaptor sequences to said fragH^rts; ^\ 

fragmenting said first nucfleic acid sample to produce fragments, denaturing 
said fragments, allowufg some of spd fragments to reanngaHo form double stranded 
DNA sequences and/removing said dou^straridedlJNA sequences, 
amplification by arbitrarily primed PCR; 

hybridizmg^said first nucleic acid sample to an oligonucleotide probe bound 
to a solid support; I 

hybridizing said first nucleic acid sequence to a mismatch binding protein; 
providing a nucleic acid array; 

hybridizing said second nucleic acid sample to said array; and 

I 

analyzing a hybridization pattern resulting from said hybridization. 


35. The method of claim 34 further comprising the step of compiling the analyses 
of each individual's hybridization pattern. 


36. The method of daim 34 wherein said sequence variation is a SNP. 

37. In a computer system, a method of designing an array comprising: 


4^ 4^ 


modeling specific enzymaticrg actions between a known nucleic acid sequence and 
an enzyme; 

obtaining the result^ of s^dtfn&€tei63 enzymatic reactions; 
obtaining probe sequences based upon said results; and 
designing an array io contain said probe sequences. 


38, A method of analyzing a plurality of nucleic acid samples, comprising: 

treating a first nucleic acid sample according to a defined procedure that produces a 
10 first population of fragments, the collective sequences of the fragments comprising a 

s —i 

|q subset of the collective sequences present in the first nucleic acid sample, 

determining abundance or composition of a subset of the population of the first 
*p population of fragments; 

j?j treating a second nucleic acid sample according to the defined procedure to produce 

^15 a second population of fragments containing corresponding fragments to the fragments in 
p the first population; 

determining abundance or composition of a subset of fragments in the second 
population having sequences corresponding to the subset of fragments in the first 
population. 



